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Abstract 


Detailed  maintenance  planning  under  uncertainty  is  one  of  the  most  important  topics 
in  military  research  and  practice.  As  one  of  the  fastest  ways  to  recover  failed  weapon 
systems,  cannibalization  operations  are  commonly  applied  by  maintenance  personnel. 
Due  to  additional  complexities  introduced  by  these  operations,  detailed  maintenance 
decision  making  with  cannibalization  was  rarely  studied  in  the  literature.  This  report 
proposed  an  analytic  model  for  making  repair  decisions  in  a  multi-stage  uncertain 
environment  at  the  operational  level,  where  cannibalization  operations  are  allowed 
and  repair  lead  times  are  random.  The  study  addresses  the  problem  of  maintenance 
planning  for  military  systems  with  random  lead  times  and  independent  failures. 
The  objective  of  the  problem  is  to  maximize  fleet  reliabilities  under  operating  costs 
constraints.  A  complementary  problem  that  minimizes  total  operating  costs  under 
fleet  reliabilities  constraints  was  also  examined.  A  polynomial  algorithm  was  proposed 
to  solve  the  minimization  problem  and  determine  optimal  decision  strategies.  This 
algorithm  could  be  used  as  a  subroutine  in  a  binary-search  algorithm  to  solve  the 
maximization  problem.  The  obtained  solutions  were  proved  to  be  controllable  in  such 
a  way  that  solutions  with  designated  approximation  ratios  were  achievable  by  running 
the  algorithm  in  predictable  run  times. 

Significance  for  defence  and  security 


Under  peaceful  conditions,  operating  mangers  in  the  Canadian  Armed  Forces  (CAF) 
are  encouraged  to  dynamically  find  out  best  decisions  for  maintenance  systems ,  where 
the  word  of  "best"  refers  to  the  balance  between  operating  costs  and  fleet  reliabilities. 
As  one  of  the  fastest  operations  to  recover  a  weapon  system,  especially  for  critical 
parts,  cannibalization  is  widely  used  by  operating  managers.  For  example,  according 
to  the  Canadian  Army  Divestment  Plan,  the  CAF  National  Procurement  reductions 
will  constrain  the  CAF  to  prematurely  divest  up  to  half  of  its  Heavy  Logistics  Vehicle 
Wheeled,  Light  Support  Vehicles  Wheeled,  and  BandVangn  to  maintain  operational 
readiness  to  meet  CAF  missions.  This  report  developed  an  optimization  model  to  find 
the  best  maintenance  decisions  with  cannibalization  and  presented  solution  approaches 
to  the  model.  Operating  managers  would  use  the  model  to  make  repair  decisions  at 
the  operational  level  over  a  finite  number  of  time  periods. 
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Resume 


La  planification  de  l’entretien  detaille  en  contexte  d’incertitude  est  Tun  des  plus 
importants  sujets  dans  les  domaines  de  la  recherche  et  de  la  pratique  militaires.  L’une 
des  pratiques  les  plus  con  ram  merit  utilisees  par  le  personnel  de  maintenance  pour 
reparer  des  systemes  d’arme  defectueux  est  la  cannibalisation.  En  raison  des  difficultes 
que  comporte  ce  procede,  le  recours  a  la  cannibalisation  comme  pratique  d’entretien 
detaille  a  rarement  fait  l’objet  d’etudes.  Dans  le  present  rapport,  on  propose  un 
modele  analytique  pour  la  prise  de  decisions  operationnelles  concernant  la  reparation 
d’equipement  en  plusieurs  etapes  et  en  contexte  d’incertitude  lorsque  la  cannibalisation 
est  autorisee  et  les  delais  de  reparation  sont  variables.  L’etude  aborde  la  question  de 
la  planification  de  l’entretien  des  systemes  militaires  avec  des  delais  aleatoires  et  des 
defaillances  independantes.  L’objectif  est  de  maximiser  la  habilite  des  flottes  malgre  les 
contraintes  liees  aux  couts  d’exploitation.  On  s’est  egalement  penche  sur  le  probleme 
complementaire  de  la  minimisation  des  couts  d’exploitation  avec  des  contraintes  sur  le 
plan  de  la  habilite.  Pour  resoudre  ce  probleme  et  tro uver  des  strategies  pour  la  prise  de 
decisions  optimales,  on  a  propose  un  algorithme  polynomial.  Cet  algorithme  pourrait 
aussi  etre  utilise  comme  sous-programme  dans  un  algorithme  de  recherche  binaire  ahn 
de  resoudre  le  probleme  de  la  maximisation  de  la  habilite.  Les  resultats  obtenus  se 
sont  averes  controlables,  de  sorte  que  les  solutions  ayant  un  rapport  d’approximation 
etaient  rcalisables  en  executant  l’algorithme  pendant  une  duree  previsible. 


Importance  pour  la  defense  et  la  securite 


En  contexte  de  paix,  on  encourage  les  gestionnaires  de  l’exploitation  des  Forces  armees 
canadiennes  (FAC)  a  prendre  de  fagon  dynamique  les  meilleures  decisions  possible 
relativement  aux  systemes  d’entretien.  Prendre  les  meilleures  decisions  possible  signihe 
ici  de  trouver  un  equilibre  entre  les  couts  d’exploitation  et  la  habilite  de  la  hotte.  La 
cannibalisation  est  tres  souvent  utilisee  par  les  gestionnaires  de  l’exploitation,  car  c’est 
l’une  des  methodes  les  plus  rapides  pour  remettre  un  systeme  d’arme  a  neuf,  en  parti- 
culier  lorsqu’il  s’agit  du  remplacement  de  pieces  essentielles.  Par  exemple,  selon  le  Plan 
de  dessaisissement  de  l'Armee  canadienne,  les  reductions  dans  l’approvisionnement 
national  contraindront  les  FAC  a  se  departir  prematurement  de  pres  de  la  moitie  de 
ses  vehicules  logistiques  lourds  a  roues,  de  ses  vehicules  de  soutien  legers  a  roues  et  de 
ses  BandVagn  pour  maintenir  leur  preparation  operationnelle  en  vue  d’accomplir  leurs 
missions.  Dans  le  present  rapport,  on  present-e  un  modele  d’ optimisation  appuyant 
la  prise  des  meilleures  decisions  possible  en  ce  qui  a  trait  a  l’ent-retien  au  rnoyen 
de  la  cannibalisation.  On  presente  egalement  les  demarches  employees  pour  trouver 
des  solutions  a  l’aide  du  modele.  Les  gestionnaires  de  l’exploitation  feront  usage 
de  ce  modele  pour  prendre  des  decisions  operationnelles  concernant  la  reparation 
d’equipement. 
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1  Introduction 


1.1  Background 

Ill  a  military  maintenance  network,  when  a  weapon  system  or  prime  equipment  (PE) 
is  malfunctioning,  one  or  more  responsible  parts  are  identified  and  replaced.  The 
failed  parts  will  be  separated  from  the  PE  and  sent  to  a  central  depot  for  repair. 
The  left  holes  are  supposed  to  be  filled  with  functioning  parts.  If  spare  parts  are  not 
available,  cannibalizing  functioning  parts  from  other  failed  PEs  is  sometimes  applied 
in  practice.  Figures  1  and  2  describe  a  cannibalization  example.  The  example  has  a 
fleet  with  three  PEs:  PE-1,  PE-2  and  PE-3.  Each  PE  is  made  up  of  three  parts/LRUs 
(line  replaceable  units)  in  three  different  types:  L-l,  L-2  and  L-3.  In  the  figures,  if 
an  LRU  is  functioning,  it  is  denoted  by  a  solid-double-linc  rectangle;  otherwise  a 
dashed-doublc-line  rectangle  is  used.  Figure  1  has  PE-1,  PE-2  and  PE-3  waiting  for 
L-3,  L-2  and  L-l  replacement,  respectively.  The  two  arrows  indicate  that  PE-1  and 
PE-3  can  cannibalize  L-3  and  L-l  from  PE-2,  respectively.  Using  cannibalization 
operations,  PE-1  and  PE-3  can  get  back  to  work  immediately,  while  PE-2  will  be  in 
an  even  worse  situation,  i.e. ,  there  are  three  LRU  "holes"  on  PE-2.  Figure  2  shows 
the  status  of  the  fleet  after  cannibalization.  It  is  clearly  observed  that  the  number 
of  functioning  PEs  increases  from  zero  (Figure  1)  to  two  (Figure  2).  Certainly,  the 
reliability  level  of  the  fleet  is  improved. 


Figure  1:  PE-1  and  PE-3  can  cannibalize  L-3  and  L-l  from  PE-2,  respectively. 


Figure  2:  PE-1  and  PE-3  are  recovered  after  cannibalization. 


This  example  demonstrates  that  cannibalization  is  a  useful  operation  when  there  is  no 
sufficient  spare  LRU;  this  operation  is  favored  by  maintenance  personnel  in  practice. 
In  a  study  by  General  Accounting  Office  of  the  United  States  (GAO  U.S.)  [1],  Air 
Force  and  Navy  units  reported  a  total  of  about  850,000  cannibalizations  on  aircraft 
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in  fiscal  years  1996-2000,  including  376,000  cannibalizations  by  the  Air  Force  and 
468,000  by  the  Navy.  For  the  aircraft  Bl-B  of  the  Air  Force,  the  cannibalization  rate 
(the  number  of  cannibalizations  per  100  sorties)  could  be  up  to  85.4.  For  the  aircraft 
F-14D  of  the  Navy,  the  rate  (the  number  of  cannibalizations  per  100  flight  hours) 
could  be  up  to  32.8.  As  such,  cannibalization  has  been  one  of  the  most  important- 
topics  in  military  research  and  practice  since  the  1970s. 

By  US  DoD  (the  Department  of  Defense,  United  States)  [2],  there  were  14,800  aircraft, 
896  strategic  missiles,  256  ships  and  386,600  ground  combat  and  tactical  vehicles 
under  the  2013  daily  maintenance  operations.  The  base  of  PE  failure  is  huge  such 
that  a  few  number  of  malfunctioning  PEs  won’t  make  substantial  differences  on 
actually  failure  rates.  Based  on  this,  a  large  number  of  literature  assumed  constant 
failure  rates  and  therefore  they  were  able  to  apply  queue  theory  on  maintenance  and 
reliability  study  [3-7].  In  the  CAF  reality,  however,  the  number  of  PEs  is  usually 
very  small  (e.g.,  the  Royal  Canadian  Navy  operates  four  submarines,  three  destroyers 
and  two  replenishment  vessels).  Any  failed  PEs  might  cause  substantial  decreasing  on 
the  actual  base  of  PE  failures.  This  gives  dynamically  varied  failure  rates,  i.e. ,  the 
rate  of  PE  failures  in  a  time  period  depends  on  the  number  of  functioning  PEs  at- 
t-he  beginning  of  the  period.  Using  the  theory  of  Markov  decision  process  (MDP), 
Zhang  [8]  presented  a  multi-stage  model,  in  which  detailed  cannibalization/repair 
decisions  were  made  in  each  time  period/stage  to  maximize  total  weighted  number 
of  functioning  PEs  over  a  finite  number  of  decision  periods.  As  a  follow-up  study, 
this  report  considers  independent  PE  failures;  that  is,  from  the  Probability  Theory 
perspective,  the  behaviors  of  PE  failures  during  each  time  period  are  assumed  to  be 
independent  and  identically  distributed  (HD)  random  variables. 

1.2  Aim 

Operating  managers  in  CAF  are  required  to  follow  a  defined  decision  policy  under 
wartime  conditions.  However,  under  peaceful  conditions  they  are  encouraged  to  dy¬ 
namically  find  out  which  decision  is  best,  where  the  word  of  "best-"  indicates  the 
balance  between  "reasonable"  operating  costs  and  "acceptable"  fleet  reliabilities.  The 
objective  of  this  report  is  to  develop  a  model  to  determine  optimal  decision  strategies 
for  maintenance  planning  of  military  systems  with  lead  times  and  cannibalization. 

1.3  Scope 

There  are  two  major  reasons  for  undergoing  cannibalization  [1]  -  inefficient  support 
system,  and  high  readiness  requirement  and  operational  demands.  This  indicates  that 
most  cannibalization  operations  are  driven  by  the  mismatch  between  LRU  demands 
and  supplies.  As  an  extreme  case,  LRU  supply  can  be  discontinued  (i.e.,  no  external 
supplies  for  LRUs).  For  example,  one  of  the  ongoing  projects  in  CAF  is  to  cannibalize 


2 


DRDC-RDDC-2014-R1 65 


its  Eryx  missile  systems  as  it  prepares  for  the  eventual  retirement  of  the  weapon  in 
2016  [9].  This  report  models  this  particular  case  by  assuming  that  the  maintenance 
network  does  not  include  any  third-party  LRU  supplies/demands;  LRUs  can  come  to 
(or  leave)  the  network  only  when  they  are  installed  on  some  PEs  and  these  host  PEs 
come  to  the  network  as  failed  PEs  (or  leave  it  as  functioning  PEs). 

In  most  situations,  a  maintenance  network  is  comprised  of  one  or  more  operating 
bases  to  hold  all  PEs  and  spare  LRUs  and  a  central  depot  to  conduct  all  repair 
operations.  Since  the  central  depot  is  usually  located  in  a  separate  site  from  the 
operating  bases,  base-depot  transportation  and  depot-repair  operations  are  out  of 
control  of  operating  managers.  This  report  considers  repair  times  and  transportation 
costs  as  uncertain  factors,  i.e. ,  repair  lead  times  and  transportation  costs  are  not 
available  until  the  times/periods  come.  Assuming  that  all  base  operations  (i.e.,  LRU 
separation,  installation  and  warehousing  operations)  are  controlled  by  operating 
managers,  this  report  focuses  on  repair  decisions  to  answer  the  question:  how  many 
failed  LRUs  should  be  sent  to  the  depot  for  repair  at  each  time  period. 

Traditionally,  maintenance  planning  problems  of  systems  with  IID-featured  PE  failures 
are  studied  using  Queuing  Theory  [3,4,10].  However,  this  method  cannot  be  used 
for  detailed  operational  decision  making.  In  this  report,  stochastic  optimization 
approaches  [11,12]  are  used  to  formulate  and  solve  the  maintenance  planning  problem 
as  a  multi-stage  decision  making  problem. 

1.4  Structure 

This  report  is  organized  as  follows.  Section  2  presents  a  literature  review  for  canni¬ 
balization,  a  description  of  the  maintenance  system,  and  a  formal  definition  of  the 
problem  for  maximizing  fleet  reliabilities.  In  Section  3,  in  order  to  solve  the  maximiza¬ 
tion  problem,  a  complementary  problem  that  determines  a  set  of  multi-stage  repair 
decisions  for  minimizing  total  operating  costs  with  constraints  on  fleet  reliabilities 
is  constructed.  A  dynamic  programming  algorithm  is  then  developed  to  solve  the 
minimization  problem.  In  Section  4,  a  binary-search  algorithm  is  developed  to  provide 
approximation  solutions  to  the  maximization  problem.  Finally,  conclusions  and  future 
research  are  presented  in  Section  5. 
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2  Preliminaries  and  MaxRho 


2.1  Literature  review 

Manufactured  products  can  fail  due  to  different  processes  such  as  corrosion,  wear 
and  tear,  and  fatigue.  Products  such  as  domestic  electronics  and  appliances  are 
generally  discarded  and  replaced  upon  failure  because  they  are  inexpensive.  However, 
capital  goods  such  as  defense  weapon  systems  (or  PEs)  are  repaired  because  of  their 
high  replacement  costs.  Repair /maintenance  decisions  often  involve  the  removal  and 
replacement  of  the  failed  parts.  Research  on  maintenance-decision-making  problems 
have  spawned  several  noteworthy  papers  in  the  open  literature  over  the  last  five 
decades,  starting  with  the  textbook  [13]  (which  fundamentals  of  maintenance  and 
reliability  research  methods  were  established)  and  the  paper  [14]  (important  earlier 
developments  in  this  research  area  were  reviewed) .  Most  research  works  before  the 
middle  of  1980’s  were  reviewed  in  the  survey  papers  [15-17].  Later,  Valdez-Flores 
and  Feldman  [18]  and  Cho  and  Parlar  [19]  reviewed  the  research  works  for  single 
and  multiple  items  maintenance  decision  making,  respectively.  For  more  recent 
developments,  the  review  papers  [20],  [21]  and  [22]  surveyed  studies  on  corrective, 
preventive  and  opportunistic  maintenance  policies,  respectively. 

In  the  military  context,  the  main  points  of  interest  are  the  level  of  repair  analysis 
(LORA)  problems  (i.e. ,  the  relocation  decisions  of  repair  facilities  on  a  support 
network  [23,24]),  the  spare  parts  stocking  (SPS)  problems  (i.e.,  the  initial  inventory 
decisions  of  spare  parts  on  a  support  network  [25,26])  and  the  combined  LORA-SPS 
problems  on  a  support  network  [27].  In  these  problems,  repair  decisions  were  either 
studied  at  the  strategic  level  using  aggregated  approaches  (e.g.,  LORA  or  LORA-SPS), 
or  even  not  considered  at  all  (e.g.,  SPS).  However,  operating  managers  are  under 
increased  pressures  to  improve  fleet  reliabilities  through  their  detailed  (or  daily)  repair 
decisions.  Thus,  there  is  a  need  to  study  multi-stage  repair  decision  making  for 
military  maintenance  networks  at  the  operational  level. 

Detailed  operational  planning  in  a  multi-stage,  uncertain  environment  is  one  of 
the  most  important  and  difficult,  topics  in  operations  research.  With  respect  to 
different  uncertainty  features,  two  major  solution  approaches  are  considered:  (1)  the 
stochastic  dynamic  programming  (SDP)  approach  [28],  in  which  a  decision  tree  is  first 
constructed  and  then  the  tree  is  solved  backwards,  is  a  common  method  to  solve  MDP 
or  MDP-featured  problems,  and  (2)  the  scenario  tree  based  (STB)  approach  [29],  in 
which  a  deterministic  scenario  tree  formulation  is  first  presented  and  then  a  dynamic 
programming  algorithm  is  developed  to  find  optimal  solutions  on  the  scenario  tree. 
Since  the  maintenance  systems  studied  in  this  report  have  IID-featured  PE  failures,  the 
structure  of  scenario  tree  used  in  the  formulation  can  be  easily  determined  in  advance. 
This  report  adopts  the  STB  approach  for  later  algorithm  design  and  analysis.  Note 
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that  the  SDP  approach  was  used  in  [8]  for  maintenance  systems  with  MDP-featured 
PE  failures. 


2.2  System  description 

The  report  considers  a  single-base,  single-depot  maintenance  network,  which  is  de¬ 
scribed  in  Figure  3.  In  the  network,  all  failed  PEs  and  spare  LRUs  are  installed  in 
the  operating  base  and  all  repair  operations  are  conducted  in  the  depot  (assuming 
that  the  warehouse  space  in  the  base  and  the  repair  capacity  in  the  depot  are  un¬ 
limited).  Initially,  there  are  N  (0  <  N  <  oo)  failed  PEs  ,Ejy)  installed  in 

the  base  and  each  PE  is  made  up  of  M  (1  <  M  <  oo)  distinct  LRUs  ,  Lm)- 

Note  that  among  these  M  LRU  positions  there  is  at  least  one  missing  LRU  hole  or 
malfunctioning  LRU  for  each  failed  PE. 


Operating  Base 


Depot 


Figure  3:  The  support  network  has  one  operating  base  and  one  repair  depot. 


Let  fm  and  gm  be  the  number  of  functioning  and  malfunctioning  Lm  in  the  base, 
respectively,  where  rn  —  1,2, , M.  Let  qm  be  the  number  of  due-in  Lm,  which  are 
either  under  repair  in  the  depot  or  in  transshipment  between  the  base  and  the  depot. 
Since  they  are  not  located  in  the  base,  they  are  not  included  in  either  fm  or  gm.  Thus, 
the  total  number  of  spare  (or  individually  existed)  Lm  can  be  calculated  as: 

Sm  =  fm  T  gm  T  Qm  dV ,  m  =  1,  2,  •  •  ■  ,  M .  (1) 

The  use  of  words  "individually  existed"  is  due  to  the  observation  that  each  PE  should 
include  a  PE  frame,  on  which  LRUs  are  installed.  When  the  extra  LRUs  are  first 
introduced  in  the  system,  they  are  not  attached  to  any  PE  frame.  Assuming  that  there 
are  no  external  supplies  or  demands  for  any  individual  LRUs,  sm  for  all  m  —  1, 2,  •  •  •  ,  M 
remain  constant  during  the  whole  decision  horizon. 

A  system  is  called  effective  if  all  failed  PEs,  whose  Lm  demands  can  be  satisfied  using 
either  the  stocked  Lm  or  the  functioning  Lm  on  other  failed  PEs  (via  cannibalization), 
are  recovered.  Given  that  all  in-base  operations  are  well-controlled  by  operating 
managers,  PE-recovery  operations  can  be  treated  as  zero-time  and  zero-cost  operations. 
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Assuming  that  the  initial  system  is  effective,  N  >  0  implies  that  there  is  at  least  one 
failed  LRU  (say  Lm)  such  that  fm  =  0,  i.e. ,  min{/i,/2, ■  •  ■  ,/m}  =  0. 

Let  [1  ,T  +  1),  where  1  <  T  <  oo,  be  the  decision  horizon  including  T  mutually  disjoint 
time  periods.  For  instance,  Figure  4  depicts  the  repair  decisions  and  PE  failures  of 
period  t  (where  1  <  t  <  T),  i.e.,  all  repair  decisions  are  assumed  to  be  made  at  the 
beginning  of  period  t  (just  after  time  t )  and  all  PE  failures  are  assumed  to  be  realized 
at  the  end  of  period  t  (just  before  time  t  + 1). 

Decisions  are  made.  Failures  occur. 

J_ h 

t  t+1 

Figure  4:  The  repair  decisions  and  PE  failures  in  period  t. 

Let  {^,g/qt  , ??,*}  denote  the  system  status  at  time  t,  where  f*  =  (/}, /|, •  , /m),  g /  = 

(91,92,- ■■  ,9m),  q*  =  (9i,92  >■■■  ,9m)  and  nL  =  fin  +  gin  +  (lm  -  sm  with  m  =  1, 2,  ■  •  ■  ,  M. 
As  a  consequence  of  the  effective-system  assumption,  the  PEs,  which  are  recovered 
during  period  t,  are  assumed  to  leave  the  system  by  the  end  of  period  t  (i.e.,  just  before 
time  t  +  1  in  the  figure).  Thus,  at  time  t  +  1  only  the  number  of  un-covered/failed 
PEs  is  recorded,  which  is  denoted  by  nt+1  with  ?it+1  =  +  g^1  PQm1  ~  sm ,  Vm. 

In  order  to  keep  notation  consistency,  the  initial  status  at  time  one  can  be  re-denoted 
by  the  1-superscripted  notations,  i.e.,  =  fm,  g]n  =  gm,  q ^  =  qm,  and  n1  =  N. 

2.3  Failures  and  repairs 

Assume  that  IID-featured  PE  failures  follow  the  same  Homogeneous  Poison  process 
(HPP)  with  mean  A  >  0  in  all  periods.  By  HPP,  the  probability  of  PE  failures  with  k 
failed  PEs  can  be  written  as: 

\  k  —X 

— J3— ,  where  k  =  0,1,-  .  (2) 

k\ 

Let  0  <  7  <  1  be  the  threshold  probability  such  that  PE  failures  with  probabilities 
smaller  than  7  would  be  considered  as  zero-probability  events.  Considering  a  reasonable 
7  value  (i.e.,  7  -C  1  is  a  relatively  small  value  compared  to  value  1),  HPP-distributed 
PE  failures  have  smaller  probabilities  as  k  (the  number  of  failed  PEs)  gets  larger.  Let 
K  be  the  largest  integer  value  such  that  ^  /i, —  >  7.  This  implies  that  PE  failures 
with  more  than  K  failed  PEs  are  considered  as  zero-probability  events.  Using  this 
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assumption,  the  PE-failure  probabilities  described  in  Equation  (2)  can  be  modified  as: 


Xke~x 

Prob(fc)  =  k' - ,  where  k  =  0, 1,  •  •  •  ,K.  (3) 

v'  \k'  e~x 
k'\ 


It  is  assumed  that  there  is  one  and  only  one  responsible  LRU  for  the  PE  failure  with 

exactly  one  failed  PE.  Let  f3m  G  [0, 1]  be  the  probability  of  such  a  PE  failure  caused 

M 

by  a  failed  Lm.  Thus,  E  (3m  =  L  Let  k  =  (/ci, fo, ■  , /cm)  be  a  vector  denoting  a  PE 

771=1 

failure  with  k  failed  PEs,  where  km  >  0  is  the  number  of  failed  PEs  (whose  responsible 
LRUs  are  Lm ),  and 

M 

k=  J2  km-  (4) 

m= 1 

is  the  total  number  of  failed  PEs.  By  letting  ko  =  0,  the  probability  of  such  a  PE 
failure  can  be  written  as: 

777—1 

M  (k  —  E  km'\  , 

Prob(fci,fc2,-- ,kM  :  k)  =  Prob(/c)  m'= o  ){/3m)km.  (5) 

777=1  \  km  ) 

It  is  easy  to  prove  that  Equation  (5)  is  a  probability  mass  function  satisfying  the 
following  two  fundamental  conditions  in  Discrete  Probability  Theory: 


0  <  Prob(£q,  k-2,  ■  ■  ■  ,  /cm  :  k)  <  1, 


(6) 


and 


_  K 

1=  prob(/ci,/c2,---  ,kM  ■  k), 

,kM)e)C(k)  k= 0 


(7) 


M 

where /C(/c)  =  {(/cp /c2,  •••, /cAf)|/c  =  E  km,km  >  0,Vm}  and  k  =  0, 1,  •  •  •  ,K. 

777=1 


Consider  period  t,  let  x ^  be  the  number  of  failed  Lm  sent  to  the  depot  for  repair. 
The  zero-cost,  and  zero-time  assumption  for  all  in-base  operations  implies  that 
(where  0  <  <  glm  with  m  =  1,2,  ■  ,M)  are  the  only  decision  variables  for  repair 

operations  in  period  t.  Let  =  (k\,  k^,  ■  ■  ■  ,/cm)  be  the  PE  failure  occurring  in  period 
M 

t ,  where  kl  —  E  Thus,  in  the  base,  the  total  number  of  malfunctioning  Lm, 

777=1 

which  are  the  resources  of  Lm-repair  operations  in  period  t  +  1,  can  be  calculated  as: 


9m  9m  ^ 777  T  km- 


(8) 
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Since  repair  and  transshipment  operations  are  not  controlled  by  operating  managers, 
random  repair  lead  times  are  assumed.  Let  (tfm  be  the  lead  time  of  Lm  repair  in  period 
t.  (Note  that  lead  times  are  assumed  to  be  non-crossover,  i.e.,  t\  -\-dfy  <  t2Jrdt%l  if 
t\  <  t2-)  Let  =  {rlr  +  ^m  =  be  the  set  including  all  periods,  in  which  if  a  failed 
Lm  is  sent  to  the  depot  for  repair  it  will  be  repaired  and  returned  to  the  base  in 
period  t.  Therefore,  taking  into  account  all  functioning  LRUs  on  the  newly-failed  PEs 
in  period  t,  the  total  number  of  functioning  Lm  in  the  base  is: 

/m  =  /m  +  ^-*4+  Y  Xm ■  (9) 

TGz'A.m 


Using  the  effective-system  assumption,  the  number  of  recovered  PEs  would  be: 


n 


=  min  {n*  +  k\  ft,  ft,  ■■■  ,  ftM} 

=  mini  nl  +  kf ,  min  \  fL  +  k* 
L  171 


£  <}}• 

T<=Am 


(10) 

(11) 


As  assumed,  the  recovered  PEs  are  supposed  to  leave  the  system  by  the  end  of  period 
t.  Thus,  the  system  is  updated  to  (f*+1,g*+1,qt+1,n*+1),  where 


fm1  =  ftm-rf  =  ftm  +  kt~ktm+  Y.  Xm  ~  Vm  (12) 

9m  1  =9 m  =  9 m-  xm  +  klv  (13) 

Qm1  =  Qm+xm-  Y  xm,  (14) 

7it+1  =  nt  +  k*  —  h1 .  (15) 


2.4  Objective  and  constraints 

Since  all  in-based  operations  are  well-controlled  by  operating  managers,  this  report 
considers  out-base  costs  only:  LRU  transshipment  and  repair  costs.  Let  ct  be  the 
total  operating  cost  of  period  t,  which  is  the  sum  of  transportation  (fixed  and 
period-dependent)  and  repair  (variable  and  LRU-dependent)  costs.  Let  hl  be  the 
transportation  cost  of  period  t  if  there  is  some  malfunctioning  LRUs  sent  to  the  depot 
for  repair  and  let  rm  be  the  per-LRU  repair  cost  for  the  Lm-repair  operations  in  period 
t.  Thus,  the  total  operating  cost  in  period  t  can  be  written  as: 

M 

ct  =  htyt+  Y  vL,  (16) 

m=  1 

where  y 1  is  a  binary  decision  variable  such  as  yt  =  1  if  there  is  at  least  one  >  0  for 
some  m  G  {1,2,  •  •  •  ,M}  and  y*  =  0  if  xtm  =  0  for  all  m  =  1,2,  •  •  •  ,M. 
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Let  a  (where  a  >  0)  be  the  interest  gained  on  per  unit  unused  fund  for  each  period. 
Considering  T  periods,  at  time  one  the  present  value  of  total  operating  cost  can  be 
written  as: 


c  = 


T 


E 


(l  +  Q?)t  1 


(17) 


Consider  fleet  reliabilities,  which  can  be  evaluated  by  recovering  ratios.  Let  pf  be  the 
recovering  ratio  of  the  first  t  periods,  where  t  =  1,2,  •  •  •  ,T.  Thus, 


P 


t 


t+ 1 

E  nT 

T—  1 

t~\~  1 

n1  +  E  kT 

T=  1 


(18) 


Adding  a  positive  value  of  n1  as  part  of  the  denominator  is  to  avoid  a  possible  zero 
denominator,  i.e.,  pl  are  well-defined  for  all  t  —  1,2,  •  •  •  ,T. 


Let  B  (where  0  <  B  <  oo )  be  the  total  operating  budget.  The  problem  that  maximizes 
fleet  reliabilities  with  constraints  on  total  operating  costs  (denoted  by  MaxRho)  can 
be  formulated  as: 


max:  p  s.t.:  c  <  B  and  pl  >  p,  Wt  =  1,2,  •  •  •  ,T,  (19) 

where  at  the  end  of  each  time  period  systems  are  updated  using  the  calculations 
described  in  Equations  (12),  (13),  (14)  and  (15). 

In  order  to  solve  MaxRho,  a  complementary  problem  is  constructed  to  minimize  total 
operating  costs  with  constraints  on  fleet  reliabilities.  This  problem  is  denoted  by 
MinCost  and  it  can  be  written  as: 

min:  c  s.t.:  pf  >  A,  Vt  —  1,2,  •  ■  •  ,T,  (20) 

where  0  <  A  <  oo  is  the  designated  reliability  level.  Again,  at  the  end  of  each  time 
period  systems  are  updated  using  the  calculations  described  in  Equations  (12),  (13),  (14) 
and  (15). 

In  the  following,  MinCost  is  first  formulated  on  a  scenario  tree  and  solved  using  a 
dynamic  programming  algorithm.  Then,  MaxRho  is  solved  approximately  by  solving 
a  series  of  MinCost,  whose  designated  reliability  levels  are  dynamically  updated. 
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3  MinCost  and  approaches 


3.1  Scenario  tree  formulation 

Using  the  commonly-adopted  solution  strategy  (i.e.,  the  STB  approach),  multi-stage 
stochastic  optimization  (MSO)  problems  are  solved  in  two  steps:  (1)  A  MSO  problem 
is  represented  as  a  scenario  tree  model  (STM),  which  is  the  deterministic  equivalence 
of  the  MSO  problem;  (2)  A  dynamic  programming  algorithm  is  designed  to  search 
for  optimal  solutions  on  the  scenario  tree  and  the  found  solutions  are  actually  the 
solutions  to  the  MSO  problem.  Next,  a  formal  STM  formulation  for  MinCost,  denoted 
by  MinCostSTM,  is  presented. 

The  T-stage  MinCost  model  can  be  described  using  a  (T  +  l)-level  scenario  tree,  whose 

branches  are  constructed  based  on  PE  failures.  Let  T  be  the  tree.  Let  V  be  the  set  of 

T+l  K 

nodes  on  T  and  let  |V|  =  X)  m  be  the  number  of  nodes  in  V,  where  T  =  S  |/C(/c)| 

t= 0  k= 0 

is  the  number  of  distinct  PE  failures  during  each  time  period  (i.e.,  the  number  of 

branches  on  each  non-leaf  node,  whose  definition  will  be  given  later)  and  |/C(fc)|  is 
the  number  of  distinct  PE  failures  with  exactly  k  failed  PEs  (A;  =  0,1,---  ,  K).  In  this 
report,  |V|  is  used  to  denote  the  size  of  T,  which  also  is  the  size  of  MinCostSTM.  Note 
that  |V|  is  exponential  on  T  (which  is  the  number  of  time  periods  in  MinCostSTM). 

Recall  that  M  (where  M  >  0)  is  the  number  of  distinct  LRU  positions  on  each  PE.  It 
was  assumed  that  among  these  M  LRUs  there  is  one  and  only  one  responsible  LRU 
for  the  PE  failure  with  exactly  one  failed  PE.  Therefore,  by  Combinatorial  Number 
Theory  [30-32],  |/C(A)|  can  be  calculated  using  the  following  recursion  formula: 

M 

|/C(A)|  =  recPro(A,M)  =  y~]  recPro (k  —  M,M),  (21) 

M= l 

where  the  boundary  condition  is  recPro(0,M)  =  1.  Therefore,  F  can  be  re-written  as: 
K  K  KM 

r  =  £lKWI  =  £  recPro  (k,M)  =  E  E  recPro  (k  —  M,M).  (22) 

k= 0  k= 0  k=0M=l 

Consider  an  example  that  each  PE  is  made  up  of  two  distinct  LRUs  (i.e.,  M  —  2)  and 
the  maximum  of  failed  PEs  in  each  time  period  is  five  (i.e.,  K  =  5).  Following  the 
calculations  in  Equations  (21)  and  (22)  gives  T  =  1  +  2  +  3  +  4  +  5  +  6  =  21. 

Let  l (j )  be  the  level  of  node  j  G  V.  If  l(J)  —  T  +  1,  then  node  j  is  called  a  leaf  node. 
Reserving  index  1  for  the  root  node  gives  /(l)  =  1.  Let  a  (j )  be  the  direct  ancestor 
node  of  node  j  and  let  T> (j )  =  {i\j  =  a(i),Vi  G  V}  be  the  set  of  direct  descendant 
nodes  of  node  j.  In  particular,  a(l)  =  0  and  T> (j )  =  0  for  all  leaf  nodes  j.  Let  T (j )  be 
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the  subtree  rooted  on  node  j  and  let  V(j)  be  the  set  of  nodes  on  T{j).  This  gives 
alternatives:  T  =  T(  1)  and  V  =  V(l). 

Let  C(j,Z)  =  =  l(j)  +  l,Vi  G  V(j)}  be  the  set  of  nodes,  which  are  in  V(j)  but 

with  /  levels  higher  than  node  j.  In  particular,  C(j,  1)  =V(j).  There  is  no  doubt  that 
the  leftover  malfunctioning  LRUs  on  node  j  can  be  repaired  on  any  node  in  C(j,  1) 
(or  V(j)).  Let  C(l )  =  =  l,Vi  G  V}  be  the  set  of  nodes  on  T  with  level  l,  where 

l  —  1,2,  •  •  •  ,T  +  1.  If  failed  Lm  are  sent  to  the  depot  for  repair  on  node  j,  then  they 
will  be  repaired  and  arrive  on  all  nodes  in  V(j)  with  level  /(j)  +  d(ym),  i.e.,  on  all 
nodes  in  C(J,d^m)).  Note  that  C{j,d^m))  =  C{l(j)  +  d^m))nV(J). 

Considering  scenarios,  let  V (j )  be  the  path  from  node  1  to  node  j  including  nodes 
f  and  j.  If  j  is  a  leaf  node,  then  V (j )  indeed  denotes  a  scenario.  The  number  of 
scenarios  on  T  is  just  the  number  of  leaf  nodes  on  T  (which  can  be  calculated  by  YT). 
This  is  due  to  that  T  has  T  + 1  levels  and  each  non-leaf  node  has  exactly  Y  branches 
(assuming  IID-featured  PE  failures).  A  scenario  or  path  is  a  series  of  events  and  the 
probability  of  a  scenario  or  path  is  represented  by  the  probability  of  the  node,  with 
which  the  scenario  or  path  ends  up.  For  instance,  let  pj  be  the  probability  of  node 
j  G  V,  then  pj  is  the  probability  of  V {j ) .  All  reliability  and  cost  calculations  are  based 
on  either  paths  or  scenarios. 

To  illustrate  the  above  notations,  a  scenario  tree  example  is  given  in  Figure  5.  Node  i 
is  the  direct  ancestor  of  node  j,  i.e.,  i  =  a(j).  Nodes  j^,  jk2  and  jk3  are  the  direct 
descendant  nodes  of  node  j,  i.e.,  V(j)  =  {jk  1,jk2,jk3}-  In  the  figure,  T(jk2)  is  a 
subtree  rooted  on  node  jk2  and  all  nodes  in  T(jk2)  form  V(jk2)-  Path  V(j),  which 
includes  node  j  and  all  ancestor  nodes  of  j,  is  included  as  a  common  subpath  by  the 
nine  scenarios  presented  in  the  figure.  Obviously,  this  figure  does  not  depict  all  nodes, 
branches  or  scenarios.  As  can  be  seen,  node  jh  is  three  levels  higher  than  node  j.  This 
gives  C(j, 3)  =  V(j)  n£(Z(j)  +  3)  =  V(j)n£(l(jh))  =  {jh,  jhx,Jh2,  ■  ’  ■  Jh8}- 

Considering  probabilities,  since  node  1  denotes  the  initial  status,  which  is  well-defined, 
the  probability  is  p\  —  1.  Since  all  nodes  in  level  2  are  branched  from  node  1, 

Z)  Pj=pi  =  l.  (23) 

j€T>(l)  (or  je£{2)  or  jeC{l,l)) 

More  generally,  X)  Pi  =Pj  is  true  for  all  non-leaf  node  j.  Therefore,  the  sum  of 

ieV(j) 

probabilities  for  all  nodes  on  the  same  level  is  one,  i.e.,  Pj  —  U  and  the  sum 

of  probabilities  for  all  scenarios  is  one,  i.e.,  Pj  —  L  Next,  MinCostSTM  is 

j-.l(j)=T+ 1 

formulated  on  a  (T  +  l)-level  scenario  tree,  which  is  similar  to  Figure  5. 

In  MinCostSTM,  instead  of  using  the  notations  introduced  in  Sections  2.2,  2.3  and  2.4, 
the  following  notations  are  used  to  denote  the  initial  system  on  node  one,  i.e.,  n\  =  n1, 
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Figures:  Scenario  tree  notations:  V(j),  T(jk2),  V(jfc2)>  V(j),  C(j, 3)  and  C(l(jh)). 

fi  =  f1,  gx  =  g1  and  =  q1.  Without  loss  of  generality,  qx  =  (0,0,  ••  •  ,0)  is  assumed. 
Generally,  let  D?  =  •  •  •  ,-D(j,M))  be  the  cumulative  LRU  demands  by 

node  j,  Vj  E  V;  that  is  when  D(j  for  all  m  =  1,2,  •  ■  ■  ,M  are  satisfied,  the  fleet 
reliability  constraint  on  node  j  is  satisfied,  i.e. ,  pj  >  A,  where  A  is  the  required  level 
for  fleet  reliabilities.  For  simplicity,  "cumulative  demand"  is  replaced  with  "demand" 
in  the  remaining  part  of  the  report. 

In  T,  PE  failures  are  denoted  by  branches.  For  instance,  in  Figure  5,  the  branch 
connecting  node  j  and  denotes  the  PE  failures  occurring  in  time  period  [l(j),l(jk i))- 
Since  no  branch  heads  into  node  1,  Di  is  undefined  and  simply  set  as  Di  =  (0,0,  •  •  •  ,0). 
In  order  to  determine  LRU  demands,  consider  a  scenario  V(j).  For  any  node  j  E  V{j) 
with  j  >  1,  let  D j  =  ■  ■  ,  D^  M\)  be  the  fundamental  demand  (F-demand) 

by  node  j.  Let  kj  =  ■  •  •  ,  k be  the  PE  failures  denoted  by  the  branches 

heading  into  node  i  for  all  i  E  V(j)  \{1},  where  k is  the  number  of  failed  PEs  due 
to  malfunctioning  Lm.  In  order  to  satisfy  the  fleet  reliability  constraints,  the  total 
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number  of  recovered  PEs  by  node  j  is  at  least: 

n(j)=\Axn(j)],  (24) 

where 

M 

n(j)  =  m  +  51  5Z  (25) 

ieV(j)\{l}m=l 

is  the  total  number  of  failed  PEs  by  node  j.  Thus,  the  F-demand  of  Lm  by  node  j  is: 

D(j,m)  =max{0,n(j)-/(lim)-  51  51  (26) 

*€P(j)\{l}  ra'=l,--  ,m— l,m+l,—  ,M 

For  any  given  Lm,  let  if where  h  —  and  H  <  T  +  1,  be  the  nodes  on  V{j) 

such  that  DuhjTn\  >  D^m\  for  all  i  G  V(a(ih))  and  1  <  l(i\)  <  l (12)  <  ■  ■  ■  <  1(ih )  <  T  +  l. 
Thus,  for  any  node  j  =  or  j  G  V(ih+ 1)  \  (' P(ih )  U{*/i+l});  the  demand  of  Lm  by 
node  j  can  be  determined  as:  (letting  iq  =  1) 

=  (27) 

This  revised  demand  definition  considers  the  truth  that  it  is  meaningless  for  any 
cumulative  demand  which  is  smaller  than  some  demands  occurring  in  earlier  stages. 
The  existence  of  such  demand  structure  is  due  to  the  fact  that  functioning  parts  on 
some  failed  PEs  could  be  used  to  recover  some  other  failed  PEs.  Such  self-recovery 
processes  might  reduce  the  actual  demand  on  some  stages. 

Consider  the  case  that  in  7 -  j  for  some  Lm,  i.e.,  the  Lm  demand  on  V(j)  reaches  its 
maximum  before  the  leaf  node  j.  With  respect  to  Lm  demands,  V (j )  can  be  replaced 
with  V(ih)-  Part  of  V(j),  i.e.,  from  the  direct  descendant  node  of  ijj  to  node  j,  are 
vanished  with  respect  to  Lm  demands.  By  conducting  the  same  Lm-demand  analysis 
on  all  other  scenarios,  there  might  be  more  vanished  branches  and  nodes.  The  scenario 
tree  T  is  reduced  to  the  Lm-dernand  layer,  denoted  by  Tm-  Apparently,  Tm  QT-  A 
formal  description  for  how  to  construct  demand  layers  and  how  to  use  this  concept  in 
developing  and  evaluating  searching  algorithms  will  be  discussed  later. 

For  a  given  node  j  G  V  with  l(j)  <  T  +  1,  let  x?-  =  {x(ym.)  Ix(ym)  =  x^,  where  x ^  G 
, rn  =  1,2,---  ,M  and  l(j)  =  t}  be  the  notation  for  LRU-repair  decisions  on  node 
j.  Let  yj  =  {0,1}  be  the  binary  variable  such  that  yj  =  0  (if  £(ym)  =  0  for  all 
rn  —  1,2,---  ,M)  and  yj  =  1  (if  £(ym)  >  0  for  some  m  G  (1,2,- ••  , M}).  As  known, 
x(j,m)  is  upPer  bounded  by  the  number  of  malfunctioning  Lm.  In  particular,  for  node 
1  the  upper  bound  is  given  as  x^i  m^  <  g(ijTny  For  all  other  nodes  j  G  V\{1}  with 
l(j)  <  T  +  1,  the  upper  bounds  are: 

x(j,m)  —  9(1, m)  T  5  (  ( k(i,m )  x(a(i),m)'j  ■  (28) 

ieP(j)\{  l) 
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To  satisfy  the  fleet  reliability  requirement,  the  Lm  demand  by  node  j  should  be 
satisfied  as: 

ieV(a(j)):jeV(h)  where  h&C(i,d^irn)) 

Replacing  hL  with  hj  for  l(j)  =  t  gives  the  formal  presentation  of  MinCostSTM: 

(  M  \ 


hjVj  +  Z  rmX(j,m) 

m=  1 


V — ■v  \  771=1  / 

min:  >  pj— - - - , - — 

jeV:l(j)<T+ 1  (l  +  «)  J 

(30) 

S.t..  ^ 

i€V(j):j€V(h)  where  hsC{i,d^m)) 

Vj,m 

(31) 

x(j,m)  —  9(1, m)  +  y  )  (k(i,m)  ^( a(i),m ))  j 

i€V(j)\{  1} 

Vj,m 

(32) 

x(j,m)  —  Vj^i  ^jim 

(33) 

X(j,m)  are  non-negative  integers,  Vj,m 

(34) 

yj  are  binary  variables,  Vj, 

(35) 

where  the  objective  (30)  is  to  minimize  the  total  expected  operating  cost.  The  con¬ 
straints  expressed  in  (31)  describe  the  fleet  reliability  requirements  and  constraints  (32) 
show  that  the  number  of  repair  operations  are  upper  bounded  by  the  number  of  failed 
LRUs.  Constraints  (33)  indicate  that  positive  values  are  available  only  when 

positive  ijj  values  are  assigned  and  constraints  (34)  and  (35)  demonstrate  non-negative 
X(j^rn}  and  binary  ijj ,  respectively.  Note  that  Z  is  a  constant  with  a  large  positive 
value. 


3.2  Polynomial  algorithm 

In  production  planning,  research  on  minimizing  the  total  production  and  inventory  cost 
has  a  long  history.  Generally,  in  order  to  satisfy  demands,  which  usually  dynamically 
arrive  during  a  finite  number  of  time  periods,  items  are  manufactured/ordered  with 
a  certain  lead  time,  under  a  fixed  operating  cost  and  a  variable  cost  on  each  item. 
In  addition,  over-manufactured/ordered  items  and  un-satished  demands  might  be 
penalized  for  overstocking  and  backorder  costs,  respectively. 

For  deterministic  production  planning  problems  in  which  demands  in  later  time  periods 
are  also  known  before  or  at  time  one,  Wagner  and  Whitin  [33]  proved  that  if  the 
problem  had  zero  initial  inventories,  unlimited  ordering  capacities  and  zero  lead  times, 
then  there  existed  an  optimal  production  schedule  such  that  the  accumulated  order 
quantity  by  period  t  was  exactly  the  accumulated  demand  by  some  period  r,  where 
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0  <  t  <  t  <  oo.  This  was  called  the  Wagner- Whitin  property.  However,  with  stochastic 
features,  i.e.,  later  demands  wouldn’t  be  available  until  the  time  comes,  Ahmed  [34] 
showed  that  the  Wagner- Whitin  property  didn’t  hold  if  lead  times  were  non-zero. 
Halman  et  al.  [35]  showed  that  this  stochastic  version  was  NP-hard  with  respect  to 
the  number  of  time  periods,  even  though  there  were  no  ordering  costs  and  only  two 
demand  events  for  each  time  period.  For  the  same  stochastic  problem,  Guan  and 
Miller  [36]  proved  the  Production-Path  property,  which  is  actually  a  modified  version 
of  Wagner- Whitin  property  to  compromise  the  stochastic  feature  of  the  problem.  That 
is,  there  existed  an  optimal  production  schedule  such  that  the  accumulated  order 
quantity  by  period  t  was  exactly  one  of  the  accumulated  demand  combinations  by 
some  period  r,  where  0  <  t  <  r  <  oo.  Using  this  property,  they  were  able  to  solve  the 
scenario  tree  version  of  the  problem  in  polynomial  time  with  respect  to  the  size  of 
the  scenario  tree.  Huang  and  Kucukyavuz  [37]  further  extended  the  Production-Path 
property  to  more  general  problems,  in  which  all  lead  times,  ordering,  purchasing  and 
inventory  costs  were  random.  However,  the  production  planning  problems  addressed  in 
these  papers  considered  only  one  type  of  item  and  the  production/ordering  capacities 
were  unlimited  over  the  whole  decision  horizon. 

In  MinCostSTM,  each  PE  includes  a  number  of  LRUs  and  a  failed  PE  cannot  get- 
back  to  work  until  all  of  the  installed  LRUs  are  functioning  well,  i.e.,  a  PE  demand 
cannot  be  satisfied  until  all  the  associated  LRU  demands  are  satisfied.  Moreover,  in 
MinCostSTM  each  newly-failed  PE  brings  one  malfunctioning  LRU  (which  can  be 
seen  as  an  LRU  demand)  and  M  —  f  functioning  LRUs  (which  can  be  cannibalized 
as  functioning  LRUs)  into  the  system.  MinCostSTM  has  correlated  LRU  demands 
and  supplies.  On  the  other  hand,  in  each  time  period  LRU-repair  operations  are 
upper-bounded  by  the  number  of  malfunctioning  LRUs  at  the  beginning  of  the  period. 
Since  malfunctioning  LRUs  are  introduced  dynamically  by  the  newly-failed  PE  failures, 
LRU-repair  operations  are  constrained  dynamically.  MinCostSTM  is  a  multi-item 
production  planning  problem  with  dynamically-varied  production  capacities. 

In  the  extended  version  of  Production-Path  property  (i.e.,  Proposition  5  in  [37]),  for 
any  given  node  j  if  it  has  a  positive  ordering  quantity,  then  the  total  ordering  quantity 
on  V (j )  is  equal  to  the  total  demand  on  V(i),  where  node  i  is  a  node  located  in  one 
of  the  subtrees  rooted  on  the  nodes  in  the  intersection  set  of  V (j )  and  C(l(j)  +  dj ). 
Note  that  dj  is  the  lead  time  for  the  production  decisions  made  on  node  j.  Since 
this  proposition  considers  single-item  products  and  unlimited  production  capacities, 
in  order  to  coordinate  MinCostSTM,  where  each  PE  has  M  LRUs  and  LRU  repair 
operations  are  dynamically  upper  bounded  by  the  number  of  malfunctioning  LRUs, 
necessary  modifications  on  the  property  are  required. 

Recall  that  if  a  malfunctioning  Lm  is  sent  to  the  depot  for  repair  on  node  j,  then  it 
will  be  repaired  and  returned  to  the  base  on  all  nodes  in  C(j,  d(jm)),  i.e.,  the  nodes  in 
V (j )  with  level  l(j)  +  d(j^my  This  implies  that  the  L^-repair  operations  on  node  j 
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can  be  used  to  satisfy  the  Lm  demands  on  any  node  h  G  V(i),  where  i  G  C(j, dgmj). 
Due  to  the  non-crossover  and  feasible-solution  assumption,  the  Lm-repair  operations 
on  nodes  in  V(a(j))  should  be  large  enough  to  satisfy  all  of  Lm  demands  on  nodes 
in  U i£C(j,d(j  m))  (’^’(a(*))  \^(a(i)))-  Next,  Proposition  1  consolidates  and  modifies  the 
propositions  in  [33, 36, 37]  to  reflect  the  discussions  above. 

Proposition  1.  For  MinCostSTM,  there  exists  an  optimal  solution  tt  =  {(yj,Xj)\Wj  G 
V}  such  that  if  x(j.m)  >  0  for  some  j  G  V  and  m  G  {1,2,---  ,M},  then  (I)  yj  =  1; 
(II)  satisfies  the  repair  capacity  constraint  (32) ;  and  (III)  the  fleet  reliability 

constraint  (31)  is  addressed  by: 

D(h,m)  (36) 

j'£A{i,m)r\V(j) 

where  i  G  C(j,d(j,m))  and  /iGV(i).  Note  that  Xj  =  (xgipxg^r"  ,x(j,M))- 

Proof.  Similarly  to  the  proof  of  Proposition  5  in  [37] ,  this  proposition  is  proved  by 
contradiction.  Let  node  j  be  some  node  in  V  such  that  Equation  (36)  is  violated. 
This  means  that  for  all  nodes  i  G  C(jjdgm^),  where  m  G  {m'\xgm^  >  0  and  ml  = 
1,2,  ,  M},  there  does  not  exist  any  node  h  such  that  h  G  V(i)  and  Equation  (36) 

is  satisfied.  In  the  following  proof,  by  re-allocating  all  or  a  portion  of  £(ym),  the 
updated  xg^m)  (denoted  by  x'g  rg )  will  be  zero  or  satisfy  Equation  (36).  After  the 
re-allocation,  it  is  observed  that  the  fleet  reliability  constraints  are  satisfied  with  in 
most  cases  smaller  operating  costs.  (Note  that  the  operating  cost  won’t  be  increased.) 
This  contradicts  the  original  statement  that  (y.;.x,;)  G  n  is  an  optimal  solution  to 
MinCostSTM. 

A  positive  xg,m)  implies  l (j )  +  dg^  <  T  + 1.  If  this  is  not  the  case,  then  letting 
x(j,m)  =  0  won’t  cause  any  problem  other  than  reducing  operating  costs.  For  the  case  of 
l(j)  +d(j  rn-j  <  T  + 1,  consider  a  leaf  node  i  G  C(j,  T  +  l  —  l(j)).  If  x^m^  =  0  for  all  nodes 
h  G  V(a(i ))  \  V(j),  then  i  is  called  a  mLeaf  node  of  node  j.  The  Lm  demand  on  i  is 
satisfied  by  Lm-repair  decisions  on  V(j).  If  >  0  for  some  node  h  G  V(a(i))  \V(j), 

then  let  the  node  with  the  smallest  1(h)  such  that  1(h)  +  >  l(j)  +  (fj.m)  and 

1(h)  =  l(j)  +  be  the  mGRoot  and  mERoot  nodes  of  node  j,  respectively. 

Note  that  a  mGRoot  (or  mERoot)  node  might  be  shared  by  more  than  one  node. 
Let  B(j,m)  and  B(j,m)  be  the  sets  including  all  mGRoot  and  mERoot  nodes  in  Vj, 
respectively.  Let  A'(j,m)  be  the  set  including  all  mLeaf  nodes  in  Vj.  Note  that  there 
is  at  least  one  non-empty  set  among  B(j,m ),  B(j,m)  and  A'(j,m). 

To  illustrate  these  mERoot ,  mGRoot  and  mLeaf  nodes,  Figure  6  depicts  an  five-level 
example  subtree  V(j).  Let  dg  =  3  be  the  Lm-repair  lead  time  on  node  j  and  2  be 
the  Lm-repair  lead  times  on  all  other  nodes  in  V(j)  \  {j } .  Let  nodes  j,  hi  and  h-2  be 
the  only  nodes,  where  positive  Lm-repair  decisions  are  made;  that  is  the  Lm-repair 
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Figure  6:  The  mERoot,  mGRoot  and  mLeaf  nodes  of  node  j  in  the  sample  problem. 

decisions  made  on  nodes  j ,  h\  and  /12  will  arrive  at  the  shaded,  bold-line-circled  and 
dash-line-circled  nodes,  respectively.  It  is  easy  to  observe  that  nodes  i\,  12, 13  and  24  are 
the  mLeaf  nodes,  and  nodes  hi  and  ti2  are  the  mERoot  and  mGRoot  nodes,  respectively. 
Thus,  in  Figure  6,  B(j,m )  =  {hi},  B(j,m )  =  {^2}  and  A'(j,m)  =  {«i , *2, ^3, *4}- 

Note  that  =  0  and  A\j,m)  =  0  gives  C(j,d^m))  =  U hGB(j,m)C(h^d(h,m))-  Since 

Pj  =  ph,  allocating  x^jm^  to  all  mERoot  nodes  such  as  x'^  m)  =  0  and  x^h  m ^  = 

h€B(j,m) 

X(h,m)  +  x(j,m)  for  all  h  £  B(j,m )  might  reduce  the  total  operating  cost  by  hj  (if  x^j  m) 
is  the  only  positive  LRU-repair  decision  on  node  j).  This  zero- valued  x'^-  satishes 
the  proposition  statement  but  contradicts  with  the  optimality  claim  for  X(j  mj  >  0. 

If  one  or  none  of  B(j,m )  and  A\j,m )  is  empty,  let  A(j,m )  =  A'(j,m)  U  {a(i) \i  £ 
C(h,d^m^),\/h  £  B(j,m)}.  Let  be  the  largest  Lm  demand  over  all  nodes  in 

A(j,m )  and  occurs  on  node  h.  Note  that  is  also  the  largest  Lm  demand 

over  all  nodes,  which  have  to  be  satisfied  using  the  Lm-repair  decisions  on  V(j).  Using 
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the  feasible-solution  assumption,  X(j^m)  >  my  In  order  to  satisfy  Equation  (36),  let 
\j,m)  =  X(j,m)-D(h,m)  be  re-allocated  amount  such  that  x'{j  m)  =  x^m)  -  A(^m)  and 
x'(h  m)  =  X(h,m)  +  A(h  rn)  for  a11  h  e  m )■  Since  Pj  >  E  Ph ,  the  total  operating 

cost  is  reduced  at  least  (i.e.,  considering  a  =  0)  by  a  value  of  (pj  —  Ph)^(j,m)rm- 

heB(j,m) 

This  updated  aA  m)  >  0  covers  the  Lm  demand  on  node  h  but  contradicts  with  the 
optimality  claim  for  X(j.rn)  >  0. 

Note  that  the  feasibility  of  re-allocating  and  A(^m)  to  the  mERoot  nodes  in 

B(j,m)  is  due  to  the  fact  that  the  un-used  Lm- repair  capacities  (i.e.,  the  un-repaired 
malfunctioning  Lm )  can  be  transferred  to  all  nodes  h  G  B(j,m )  via  V(a(h))\V(j),  if 
there  is  no  additional  Lm-repair  operation  on  V(a(h))\V(j)  .  □ 


Proposition  1  indicates  that  a  dynamic  programming  algorithm,  which  tries  possible 
LRU  repair  operations  on  all  non-leaf  nodes  in  V,  could  be  developed  and  used  to 
determine  optimal  repair  decisions  for  MinCostSTM.  Before  this,  necessary  notations 
are  introduced.  For  node  j  G  V,  let  Vj (I) ,  where  I  =  (h,i2,m"  Um),  be  the  optimal 
solution  value  of  the  MinCostSTM  problem  defined  on  V(j),  where  -D(jm.m)  are  satished 
by  the  Lm-repair  decisions  made  on  V(a(j))  for  all  nodes  im  G  I.  Note  that  it  is 
possible  to  have  imi  =/=  im2  and  imi  =  im2 ,  where  im2,im,2  G  I  and  m\  ^  m2 ■  As  two 
extreme  cases,  i  1  =  12  =  •  •  •  =  %m  means  that  I  denotes  a  single  node  in  V  and  imi  ^  im2 
for  all  1  <  mi  ^  m2  <  M  means  that  I  denotes  M  distinct  nodes  in  V. 


Let  Uj( I,  J) ,  where  1  =  (ii,*2,-”  ,*m)  and  J  =  (ji,j2,-"  ,Jm),  be  the  objective  value 
for  the  MinCostSTM  problem  defined  on  V (j )  with  either  ^  —D^m  rry 

or  X(j^n)  —  0  for  all  m  —  1,2,  •  •  •  ,M  such  that  D(ymTry  are  satisfied  by  the  Lm-repair 
decisions  made  on  V(a(j))  for  all  im  G  I.  Considering  making  11011-zero  Lm-repair 
decisions  on  node  j,  the  upper  bound  for  such  decisions  can  be  calculated  as: 

9(j,m)  =  9(1, m)  +  ^2  (k(j',m)  ~  X(a(j'),m)]  >  (37) 

i.e.,  =  D(jm,m)  —  D(im,m)  if  0  <  D^jm  Tn^  —  (Note  that  X(j,m)  <  0 

is  not  feasible.)  On  the  other  hand,  £(ym)  =  0  is  assigned  when  Du  m^  =  Dyrn  rry  or 
D(im,m)  >  9(j,m) •  Thus,  the  value  of  Uj( I,  J)  can  be  determined  as: 


hj  +  E  x(j,m')rm' 


Uj{  I,  J)  —  ( 


+  E  MI),  if  for  some  m! 

?£C(j,  1) 


Pj  (l+a)l(P  1 

^  <  ^Umh711')  ~  ^(V-m 0  —  9{j,m') 

E  vf(I),  if  for  all  m  =  1,2,  •  •  •  ,  Af,  if 
1) 


(38) 


or  D/a 


(. jm,m )  D(im,m)  >  9 (j,m) 
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For  a  given  Lm  and  a  node  j.  let  W(j,m)  =  {i\l(j)  +  d^tm)  <  l{i)  <  Kf)  V/  G 

C(j,  1)  and  i  G  V(j7)}  be  the  set  including  all  nodes  whose  Lm  demands  have  to  be 
satisfied  by  the  Lm-repair  decisions  on  V (j ) .  Using  the  feasible-solution  assumption, 
for  any  set  of  Lm-repair  decisions  on  V(a(j))  the  remaining  Lm-repair  resources  on 
node  j  would  be  enough  to  make  the  Lm-repair  decisions  on  node  j  to  satisfy  the  Lm 
demands  on  all  nodes  i  G  W(j,  m)  for  all  m  —  1, 2,  ■  ■  ■  ,  M .  Thus,  the  value  of  Vj  (I)  can 
be  calculated  as: 


»(I)  = 


mm 


jm€{j'\j'eV{j)  and  l{j')>l(j)+d^,m)}-D^jrl 


{imS 


,)>o 


K(I,J)}-  (39) 


Let  0  be  the  dummy  notation  for  the  initial  status,  i.e. ,  0  represents  {npfpg^q^s}. 
Let  ui(0)  be  the  optimal  solution  value  of  MinCostSTM.  The  backwards  dynamic 
programming  algorithm,  denoted  by  Recursion,  starts  from  rq(0).  For  boundary 
conditions,  if  l(j )  +  d(j,m)  >  T  +  1  (where  node  j  G  V),  then  the  optimal  Lm-repair 
decision  is  X(jiTn)  =  0-  In  turn,  if  l(jm )  +  >  T  +  1  occurs  for  all  jm  G  J,  then 

Uj( I,  J)  =  0  is  determined  for  all  feasible  I.  Next,  Recursion  is  summarized. 

Algorithm  Recursion 

[Boundary  Conditions]:  For  any  pair  of  I  and  J,  if  l(jm )  +  d^m ]Tn)  >  T  +  1  for  all 
jm  G  J  and  m  =  1, 2,  •  •  •  ,  M,  then  set  Uj (I,  J)  =  0. 

[Recursion  Procedure]:  For  each  Uj( I.J),  do  the  calculations  in  Equation  (38): 


J)  —  < 


h.j+  Jj,  x(j ,m')rm'  I 

Vj  1-1 - -+  E  Vj'  (I),  if  there  is  some 

i'ecy,  l) 


(1 +a)*b)-i 

m  ^  1 5  2,  -  -  -  ,M }  with  0  <  D(jrn, —  9(j,m') 
Y  Vj/( I),  if  for  all  m  =  1,2,---  ,  M,  it  is  occurred  such  that 


j'eCU,  l) 


either  D^m  m^  D^jm  rn^  or  D^jm  m^  >  9(j,m) 

For  any  j1  G  C(j,  1),  do  the  following  calculations  as  described  in  Equation  (39): 

iq/(I)  =  min  {m»/(II,)| 

*Le{.?"|i"eV{j')  and  di")>di,)+rf(/,m)}:-D(/m,m)~-D(zm,m)>0 

where  I'  =  (i'l,^, ■  ■  •  ,i'M)- 

[Optimal  Solution]:  Recall  that  the  LRU  demands  on  node  1  is  Do  =  (0,0, •••  ,0). 

ui(0)=  min  {ui(0,J)|.  (40) 

jrn€{j'\j'€V  and  l{j')>l+d{l^m)}:DUrn  rn)>Q 
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Theorem  1.  Recursion  finds  an  optimal  solution  to  MinCostSTM  in  0(\V\'2M+2)  time, 
T+ 1 

where  |V|  =  r*  is  the  total  number  of  nodes  on  the  scenario  tree  presentation  of 
t=o 

MinCostSTM.  Note  that  for  any  given  K  and  M  in  MinCostSTM,  T  can  be  determined 
using  the  procedures  described  in  Equations  (21)  and  (22). 


Proof.  The  correctness  follows  Proposition  1  and  the  discussions  above.  Regarding  the 
complexity,  the  total  number  of  Uj(I,  J)  is  upper  bounded  by  the  production  of  the 
number  of  j,  I  and  J,  whose  maximum  values  are  |V|,  \V\M  and  \V\M ,  respectively. 
Since  the  calculation  of  each  Uj(I,3)  requires  to  evaluate  exactly  \C(j,  1)|  number  of 
iy(I),  i.e. ,  one  evaluation  for  each  node  j1  G  C(j,  1),  the  total  calculations  require 
0(| V|  x  \V\M  x  \V\M  x  \C(j,  1)|)  <  0(|V|2M+2)  computing  times,  where  \C(j,  1)|  <  |V| 
is  the  number  of  nodes  in  C(j,  1).  Since  Uj( I.J)  computing  dominates  Recursion,  the 
overall  run  time  is  0(|V|2M+2).  □ 


3.3  Run  time  analysis 


In  MinCostSTM,  a  PE-failure  (say  k ?  =  -  ■  ■  , k^M)))  introduces  into  the 


maintenance  system  k^jm^  and 


M 

m'= !,•••  ,m— ,M 


number  of  malfunctioning 


and  functioning  Lm,  respectively.  There  is  no  doubt  that  these  functioning  Lm  can 
be  used  to  recover  failed  PEs  via  cannibalization.  As  such,  the  Lm  demands  can  be 
reduced  without  using  stocked  Lm.  This  Lm-demand  reduction  might  cause  irregular 
demands  on  Lm,  e.g.,  D{j,m)  <  D(a(j),m)  (for  Lm  F-demands)  and  T>(ym)  =  D 
(for  Lm  demands)  for  some  node  j  G  V. 


Consider  a  leaf  node  j  G  V,  where  D(jjm)  =  D[a{j),m)  f°r  some  m-  Since  D(j;m)  is 
satisfied  once  is  satisfied,  eliminating  node  j  from  V  makes  no  difference 

with  respect  to  Lm-repair  decisions  or  Lm-demand  sanctifications.  If  leaf  node  j  is 
the  only  descendant  node  of  node  a(j),  then  after  node  j  is  eliminated  node  a  (j ) 
become  another  leaf  node.  Another  round  of  elimination  might  be  applied  on  a  (  j ) .  By 
repeatedly  doing  all  such  eliminations  from  the  node  with  the  highest  level  on  each 
scenario/path,  the  Lm-demand  layer  Tm  is  obtained.  Let  Vm  denote  the  set  of  nodes 
on  Tm.  By  merging  all  Tm,  a  residual  tree  (denoted  by  Tr  =  U^=lTm)  is  obtained. 
Clearly,  Tr^T.  Similarly,  letting  Vr  include  all  nodes  on  Tr  gives  Vr  =  U^f=1Vm. 
Clearly,  VflC  V  and  | Vr|  <  |V|.  Similarly,  let  Tm{j )  be  the  Lm-demand  layer  of 
subtree  T (j )  and  let  Tr(J)  be  the  residual  tree  of  subtree  T (j ) • 


Using  these  demand  layers  and  residual  tree,  the  calculation  of  optimal  solution  value 
of  the  7R(j)-based  MinCostSTM  problem,  i.e.,  Equation  (39),  should  be  modified  as: 

m(I)  =  min  {n7(I.  J)}.  (41) 
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It  might  happen  for  some  node  j  and  Lm,  Vrn (j )  =  0.  In  this  case,  jm  =  im  is  assigned 
and  therefore  X{j.m)  —  0-  Accordingly,  the  vy  (I)  and  ui(0)  calculations  in  Recursion 
are  modified  to  search  for  nodes  i'm  and  jm  on  Vm{j')  and  Vm,  respectively.  The  latter 
one  requires  the  following  modifications  on  Equation  (40): 

ui(0)=  min  {wi(0,J)}.  (42) 

jm€{j'\j'£Vm  and 

Similarly,  the  calculations  of  Uj( I.  J)  in  Equation  (38)  should  be  modified  as: 


Mi(I>  J)  —  ( 


h3  +  £  x(j,m')rm' 

Pj~ - 7ra'e{(i+aVa)-i - -+  E  ^/(I),  if  for  some  m! 

U+a)  j'eCjtV,  l) 

^  <  ~  %.,.«*')  — 

E  ^'(J),  if  for  all  m  —  1,2,---  ,M,  if  =  Dym,m) 

i'GCflO',!) 


or  D(im,m)  >  9{j,m) 

where  Cr(j,  1)  is  the  set  of  descendant  nodes  of  j  on  Tr  and  Cr(j,  1)  C  C(j,  1). 


To  illustrate  how  the  idea  of  demand  layers  and  residual  tree  reduces  run  times,  a 
four-period  sample  problem  is  used.  In  the  problem,  all  PEs  are  made  up  of  two 
different  LRUs,  LRU-1  and  LRU-2.  Initially,  there  are  no  spare  or  due-in  LRUs  for 
either  LRU-1  or  LRU-2  (i.e.,  si  =  S2  =  qi  =  =  0),  and  there  are  two  failed  PEs  with 

one  malfunctioning  LRU-1  and  two  malfunctioning  LRU-2  as  in  Table  1. 


Table  1:  The  basic  information  of  the  sample  problem. 


Items 

Number  of  Failed/Malfunctioning  Items 

Initial  Status 

Type-1  PE-failure 

Type-2  PE-failure 

PE 

2 

1 

1 

LRU-1 

1 

1 

0 

LRU-2 

2 

0 

1 

As  showed  in  the  table,  there  are  two  types  of  PE-failures:  i.e.,  Type-1  PE  failure  has 
one  failed  PE  with  a  malfunctioning  LRU-1  on  it,  while  Type-2  PE  failure  has  one 
failed  PE  with  a  malfunctioning  LRU-2  on  it.  Note  that  since  this  problem  is  used  to 
compare  scenario  tree,  residual  tree  and  demand  layers,  unrelated  information  (e.g., 
the  operating  costs,  lead  times,  and  PE-failure  probabilities)  are  not  presented. 

Let  A  =  0.5  be  the  designated  level  for  fleet  reliabilities.  Since  in  each  time  period  PE 
failures  introduce  exactly  one  failed  PE,  the  number  of  PEs  (which  are  required  to  be 
recovered  at  the  end  of  each  time  period)  are  two  (by  time  2),  two  (by  time  3),  three 
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(by  time  4)  and  three  (by  time  5).  As  such,  the  demands  for  LRU-1  and  LRU-2  can 
be  calculated  easily,  which  are  presented  on  the  scenario  trees  depicted  by  Figure  7 
(a)  and  (b),  respectively. 

Note  that  both  trees  share  the  same  tree  structure,  which  is  the  tree  structure  of 
the  scenario  tree  presentation  of  the  sample  problem.  On  each  non-leaf  node,  the 
upper  and  lower  branches  represent  Type-1  and  Type-2  PE  failures,  respectively.  It  is 
observed  that  on  almost  all  nodes  the  demands  for  LRU-1  are  less  than  the  demands 
for  LRU-2.  Apparently,  this  is  due  to  the  initial  status  that  the  two  failed  PEs  have 
one  functioning  LRU-1  and  zero  functioning  LRU-2. 


Figure  7:  The  LRU  demands  of  the  sample  problem. 


Following  the  above  node  elimination  procedures,  the  LRU-1  and  LRU-2  demand 
trees  in  Figure  7  (a)  and  (b)  are  reduced  to  the  LRU-1  and  LRU-2  demand  layers  in 
Figure  8  (a)  and  (b),  respectively.  It  is  observed  that  Figure  8  (c)  is  the  residue  tree, 
which  is  the  combination  of  the  two  demand  layers:  Figure  8  (a)  and  (b). 

In  Figure  8  (c),  the  nodes,  which  are  depicted  on  both  demand  layers  (Figure  8  (a) 
and  (b)),  are  shaded.  The  zero  value  marked  on  the  root  node  indicates  zero  demands 
for  both  LRUs.  For  the  other  two  shaded  nodes,  since  the  demands  for  LRU-1  and 
LRU-2  are  different,  no  demand  value  is  marked.  The  solid-line  and  dash-line  circled 
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^bVl^Irkftt-fttiF-^hiands  (b)  ^Pftl&eajal  tree 


Figure  8:  The  demand  layers  and  residual  tree  of  the  sample  problem. 


nodes  are  the  nodes  depicted  on  LRU-1  and  LRU-2  demand  layers  (Figure  8  (a)  and 
(b)),  respectively.  The  marked  values  in  these  non-shaded  nodes  are  the  associated 
demands  for  LRU-1  and  LRU-2. 

It  is  observed  that  the  number  of  nodes  are  reduced  from  31  (the  original  scenario  tree 
in  Figure  7  (a)  and  (b))  to  7  (the  LRU  demand  layers  in  Figure  8  (a)  and  (b))  and  11 
(the  residual  tree  in  Figure  8  (c)).  Using  the  original  scenario  tree  (Figure  7  (a)  and  (b)), 
the  estimated  run  time  for  Recursion  would  be  3i2+2x2  =  887,503,681.  However,  if 
Recursion  uses  the  residual  tree  (Figure  8  (c))  and  the  two  LRU  demand  layers  (Figure  8 
(a)  and  (b)),  i.e. ,  replacing  Equations  (38),  (39)  and  (40)  with  Equations  (43),  (41) 
and  (42),  respectively,  the  run  time  can  be  estimated  by  ll2  x  72x2  =  2,522.  Clearly, 
Recursion’s  performance  is  substantially  improved. 
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4  Approaches  to  MaxRho 


In  MaxRho,  the  goal  is  determine  a  set  of  repair  operations  to  maximize  the  fleet 
reliability  level  such  that  the  total  operating  cost  is  within  the  limited  budget.  However, 
in  most  military  situations,  if  repair  operations  have  direct  effects  on  fleet  reliabilities, 
then  there  won’t  be  strict  budget  limit  on  operating  costs.  For  example,  in  CAF, 
contingency  funds  are  always  available  for  unexpected  maintenance/repair  operations 
as  long  as  there  is  a  need.  Under  this  consideration,  the  scenario  tree  presentation  of 
MaxRho  (denoted  by  MaxRhoSTM)  is  a  means  of  determining  over  a  finite  number 
of  decision  periods  the  best  repair  operations  such  that  the  fleet  reliability  level  is 
maximized  and  the  expected  value  of  the  total  operating  cost  is  under  the  limited 
budget.  Note  that  MinCostSTM  presented  in  Equations  (30),  (31),  (32),  (33),  (34) 
and  (35)  is  to  minimize  the  expected  value  of  the  total  operating  cost  with  the  fleet 
reliability  satisfying  the  designated  level.  It  is  obvious  that  for  the  same  maintenance 
system,  MaxRhoSTM  and  MinCostSTM  share  the  same  scenario  tree  structure. 


Let  T)(?jm)(p)  denote  the  Lm  demand  by  node  j  G  V  with  respect  to  required  fleet 
reliability  level  p,  where  rn  —  1,2, •••  ,M  and  0  <  p  <  1.  Note  that  the  value  of 
D(j,m)  used  in  MinCostSTM  is  obtained  based  on  the  required  reliability  level  A, 
i.e. ,  Using  the  calculation  procedures  described  in  Section3.1, 

D(j,m)  (p)  can  be  easily  obtained  for  any  given  p  value.  Note  that  these  calculations 
cannot  be  presented  explicitly  as  general  mathematical  formulas,  and  therefore  any 
mathematical  programming  problem  with  D^j  m^  (p)  as  intermediate  variables  is  difficult 
to  solve.  In  the  following  formulation,  MaxRhoSTM  is  to  maximize  p  and  includes 
D(j,m){p)  as  Parfs  of  the  constraints.  This  shows  that  MaxRhoSTM  is  mathematically 
intractable.  Special  solution  methodologies  will  be  developed  for  it. 


max: 

s.t.: 


P 

j,rn)(P )  A  'y  ]  x(i,m)i 

i£V(a(j)):jeV(k)  where  fceC(i,d(i>m)) 

(  M  \ 

I  hjyj  +  rmx(j,m)  ) 

(1  +  aOW-1 

—  9(1,  m)  'z’  !  b(i,m)  i 


(44) 

(45) 

(46) 

(47) 

(48) 


0  <p  <  1 

(49) 

x(j,m)  are  non-negative  integers,  Vj,m 

(50) 

yj  are  binary  variables,  Vj, 

(51) 
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where  (44)  shows  that  the  goal  is  to  maximize  the  fleet  reliability  level  and  (49) 
constrains  the  level  in  a  feasible  range  [0,1].  Constraints  (45)  and  (46)  describe 
the  demand  constraints  (which  reflect  the  designated  fleet  reliability  level)  and  the 
budget  constraint  (which  requires  the  expected  value  of  the  total  operating  cost), 
respectively.  All  other  constraints  (47),  (48),  (50)  and  (51)  have  the  same  meanings  as 
in  MinCostSTM. 

As  discussed  above,  it  is  unlikely  that  MaxRhoSTM  can  be  solved  using  general 
optimization  approaches.  In  this  case,  combinatorial  optimization  approaches  (e.g., 
dynamic  programming  algorithms)  might  be  useful  in  solving  MaxRhoSTM.  Generally, 
dynamic  programming  algorithms  enumerate  and  compare  all  candidate  solutions  to 
find  out  the  solution  with  the  largest/smallest  objective  values  as  the  optimal  solution 
to  maximization/minimization  problems.  As  a  requirement,  there  should  be  a  finite 
number  of  value  options  for  decision  variables.  In  MaxRhoSTM,  however,  p  G  [0, 1] 
gives  unlimited  value  options  for  p.  It  is  unlikely  that  MaxRhoSTM  can  be  optimally 
solved  using  any  dynamic  programming  algorithm.  In  the  remaining  of  this  section, 
the  focus  is  to  develop  approximation  algorithms  for  MaxRhoSTM. 

The  approximation  algorithm  can  be  structured  as  a  general  binary  search  procedure, 
where  Recursion  is  repeatedly  called  to  solve  a  series  of  MinCostSTM  problems.  In 
these  MinCostSTM  problems,  designated  reliability  levels  are  dynamically  updated. 
Let  MinCostSTM (p)  denote  the  MinCostSTM  problem,  whose  fleet  reliability  level  is 
designated  to  be  p.  In  particular,  MinCostSTM  (A)  is  an  alternative  of  MinCostSTM  as 
the  required  fleet  reliability  level  for  MinCostSTM  is  A.  In  each  iteration  of  the  binary 
search,  an  on-hand  p  value  is  used  to  determine  a  particular  MinCostSTM(p)  and 
Recursion  is  called  to  find  the  solution.  If  a  feasible  solution  is  found  and  the  objective 
value  is  within  the  limited  budget  (i.e. ,  the  operating  budget  constrained  in  (46)  is 
large  enough  to  provide  the  fleet  reliability  leve  p),  then  the  result  is  acceptable  and  p 
is  updated  to  a  larger  value  for  next  iteration;  otherwise  it  concludes  that  the  result  is 
non- acceptable  (i.e.,  the  operating  budget  constrained  in  (46)  is  not  enough  to  provide 
the  fleet  reliability  level  p)  and  p  is  updated  to  a  smaller  value  for  next  iteration. 

Let  p*  and  p£  be  the  optimal  and  approximation  solution  values  of  MaxRhoSTM, 
respectively.  Since  p*  is  maximized  in  MaxRhoSTM,  p£  <  p*  is  obvious.  In  general, 
there  are  two  ways  to  measure  the  quality  of  p£:  (1)  the  relative  measure  p£t  > 
e^1  x  p*  (where  £t  >  1  is  the  ratio  indicator  and  p*  G  [ pet,£t  x  p£t]  is  determined) 
and  (2)  the  absolute  measure  p£n  >  p*  —  en  (where  en  >  0  is  the  range  indicator  and 
p*  G  [p£n , p£n  +  £n)  is  determined).  Based  on  this,  two  approximation  algorithms, 
BinRecT  and  BinRecN,  are  developed  to  determine  the  approximation  intervals  for  p 
with  respect  to  £t  and  en,  respectively. 

Let  U  and  L  be  the  upper  and  lower  bound  of  p* ,  respectively.  Let  t—j;  and  n  =  U  —  L 
be  the  bound  ratio  and  range,  respectively.  Using  the  assumption  that  there  is  at 
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least  one  feasible  solution  to  MaxRhoSTM,  it  is  for  sure  that  there  is  at  least  one 
failed  PE  which  could  be  recovered  using  the  limited  budget.  This  gives  the  initial 
lower  bound  L  =  T^+N  ■  Using  the  initial  upper  bound  U  —  1,  the  initial  bound  ratio 

and  range  are  t  —  TK  +  N  and  n  =  1  •  respectively.  In  BinRecT  and  BinRecN, 

t  and  n  are  repeatedly  updated  and  compared  to  £t  and  en,  respectively.  BinRecT 
and  BinRecN  stop  as  t  <£t  and  n  <  en,  respectively.  First,  BinRecT  is  presented. 

Algorithm  BinRecT 

[Preparation]:  Initialize  the  searching  parameters  and  determine  the  stop  criteria. 

•  Set  L  =  ,  U  =1,  t  —  TK  +  N  and  p{et)  =  L. 

•  Determine  the  value  of  £t,  where  £t  is  required  to  be  in  (l,i). 

[BinarySearch] :  If  t  <  £t,  then  go  to  [FinalCheck],  Otherwise  do: 

•  Set  p  =  and  determine  D^m^{p)  for  all  j  E  V  and  m  =  1,2,  •  •  •  ,M. 

•  Run  Recursion  to  solve  MinCostSTM(p). 

—  If  the  returned  result  is  non-acceptable,  then  set  U  =  p. 

—  If  the  returned  result  is  acceptable,  then  set  L  =  p  and  p{et)  =  L. 

•  Calculate  t  —  ^  and  go  to  [BinarySearch], 

[FinalCheck]:  Determine  D(ym)(U)  for  all  j  G  V  and  m  —  1,2, •••  ,M,  and  run 
Recursion  to  solve  MinCostSTM(f/). 

•  If  the  returned  result  is  non-acceptable,  then  go  to  [solution], 

•  If  the  returned  result  is  acceptable,  then  set  p(£t)  =  U. 

[Solution]:  Let  p£t  =  p(£t)  be  the  found  fleet  reliability  level  and  trace  back  to  obtain 
the  corresponding  repair  decisions. 

Theorem  2.  For  MaxRhoSTM,  BinRecT  can  find  an  (cfi1  —  ratio) -approximation 
solution  such  as  p£t  >  cj1  x  p*  in  0(\\oget{T K  +  IV)]  |V|2M+2)  time. 

Proof.  In  [Preparation],  the  initial  setting  for  p{et)  =  Tj>+N  is  due  to  the  assumption 
that  MaxRhoSTM  has  at  least  one  feasible  solution  which  provides  the  reliability 
level  at  least  T^+N ■  In  [BinarySearch],  the  bound  interval  is  cut  into  half  after  each 
iteration,  by  either  increasing  L  or  decreasing  U.  p(£t)  is  updated  only  when  an 
acceptable  result  is  returned.  After  [FinalCheck],  the  bound  interval  is  featured  such 
as  p*  E  [p(£t),£t  x  p{£t)\\  that  is  the  found  solution  p£t  =  p(£t)  >  Efi1  X  p*. 

Considering  time  complexity,  in  each  iteration  the  run  time  is  dominated  by  Recursion, 
which  runs  in  0(|V|2M+2)  time  to  solve  MinCostSTM(p).  Since  BinRecT  conducts 
binary  search  and  calls  Recursion  at  most  f loget  (TK  +  A)]  +  1  times  (including 
[FianlCheck]),  the  overall  run  time  is  O ( [ log£i (T K  +  iV)]  |V|2M+2).  □ 
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In  BinRecT,  the  qualities  of  found  solution,  i.e.,  the  value  of  approximated  bound 
ratio  (t*),  can  be  pre-estimated  by  setting  the  values  of  £t-  It  is  obvious  that  £i  —  1 
gives  p£t  >  cf1  x  p*  =  p* .  This  implies  that  p£t  =  p*  and  therefore  BinRecT  is 
an  optimal  algorithm.  However,  such  an  optimal  solution,  which  requires  a  pre¬ 
determined  approximation  parameter  £t  =  1,  requires  infinite  run  time  for  BinRecT, 
i.e.,  log i(TK  +  N)  =  oo.  It  is  impossible.  This  shows  that  using  BinRecT  solution 
qualities  can  be  bought  by  run  times  but  MaxRhoSTM  cannot  be  solved  optimally. 
Next,  BinRecN  is  designed  under  a  similar  algorithm  structure.  The  above  solution 
accuracy  and  algorithm  run  time  arguments  are  valid  for  BinRecN  as  well. 

Algorithm  BinRecN 

[Preparation]:  Initialize  the  searching  parameters  and  determine  the  stop  criteria. 

•  Set  L  =  U  =l,n  =  and  p{en)  =  L. 

•  Determine  the  value  of  £n,  where  £n  is  required  to  be  in  (0,n). 

[BinarySearch] :  If  n  <  £n ,  then  go  to  [FinalCheck],  Otherwise  do: 

•  Set  p  =  and  determine  D(j:m) (p)  for  all  j  e  V  and  m  —  1,2,  •  ■  •  ,M. 

•  Run  Recursion  to  solve  MinCostSTM(p). 

—  If  the  returned  result  is  non-acceptable,  then  set  U  =  p. 

—  If  the  returned  result  is  acceptable,  then  set  L  =  p  and  p(en)  =  L- 

•  Calculate  n  =  U  —  L  and  go  to  [BinarySearch]. 

[FinalCheck]:  Determine  D^j^{U)  for  all  j  G  V  and  m  —  1,2,- ,M,  and  run 
Recursion  to  solve  MinCostSTM(f/). 

•  If  the  returned  result  is  non-acceptable,  then  go  to  [solution], 

•  If  the  returned  result  is  acceptable,  then  set  p(en)  =  U. 

[Solution]:  Let  p£n  =  p(£n)  be  the  found  availability  level  and  trace  back  to  obtain 
the  corresponding  decision  strategies. 

Theorem  3.  For  MaxRhoSTM,  BinRecN  can  find  an  (en  —  range)  -approximation 
solution  such  as  p£n  >  p*  —  £n  in  0([lo g£n  |V|2M+2)  time. 

Proof.  Similarly  to  the  proof  of  Theorem  2,  BinRecN  exits  with  p*  e  [p(£n),p(£n)  +£n )• 
This  gives  the  found  solution  p£n  =  p(£n)  >  p*  —£n-  hr  order  to  reduce  the  range  from 
TtR+n 1  ^,0  £n'1  BinRecN  calls  Recursion  at  most  |"l°ge„  (^2^+^)"!  Buies.  Since 
in  each  iteration  Recursion  dominates  the  searching  operation  and  Recursion  runs 
0(|V|2M+2)  time  for  each  MinCostSTM(p),  the  overall  time  complexity  of  BinRecN 

^  o(\\ogEn  (Iggr)l  |v|2M+2).  □ 
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5  Final  remarks 


In  this  report,  an  analytical  model,  MaxRho  was  developed  for  multi-stage  repair 
decision  making  in  a  single-base,  single-depot  military  maintenance  network,  where 
cannibalization  operations  are  allowed  and  well-managed  in  the  operating  base  and 
repair  operations  are  conducted  in  the  depot.  The  goal  was  to  maximize  the  overall  fleet 
reliability  level  over  a  finite  decision  horizon,  while  the  total  operating  cost  is  within 
the  limited  budget.  Taking  into  account  most  real  situations,  MaxRho  included  two 
uncertain  factors:  the  independent  failures  and  the  out-of-control  repair  operations,  i.e., 
the  times  and  the  costs.  Under  peaceful  conditions  operating  managers  are  encouraged 
to  dynamically  find  out  which  decision  is  best,  MaxRho  can  be  used  to  determine  the 
most  appropriated  repair  operations  for  any  determined  time  periods  such  that  the 
total  operating  costs  are  within  the  limited  budget  and  the  fleet  reliabilities  achieve 
the  designated  level. 

To  solve  MaxRho,  a  complementary  minimization  problem,  MinCost  was  developed, 
where  the  goal  was  to  minimize  the  total  operating  costs  with  constraints  on  fleet- 
reliabilities.  Instead  of  solving  MinCost  directly,  this  report  first  re- formulated  MinCost 
as  a  deterministic  problem  on  a  scenario  tree  (MinCostSTM),  and  then  developed  a 
dynamic  programming  algorithm  (Recursion)  for  MinCostSTM.  Finally,  it  was  proved 
that  Recursion  could  find  an  optimal  solution  to  MinCostSTM  in  a  polynomial  time 
with  respect  to  the  size  of  the  scenario  tree.  In  addition,  by  conducting  more  detailed 
analysis  on  the  scenario  tree,  it  was  showed  via  a  sample  problem  that  Recursion 
could  run  much  faster  than  the  claimed  run  time. 

Instead  of  directly  dealing  with  MaxRho,  this  report  developed  two  approximation 
algorithms,  BinRecT  and  BinRecN  to  solve  MaxRhoSTM  (the  scenario  tree  presenta¬ 
tion  and  the  deterministic  equivalence  of  MaxRho) .  Both  algorithms  implemented 
binary  search  to  iteratively  narrow  the  intervals,  which  included  the  unknown  optimal 
solution  value  of  MaxRhoSTM.  In  BinRecT  and  BinRecN,  the  found  solutions  were 
within  the  relative  (ratio)  and  absolute  (range)  measured  neighborhoods  of  the  un¬ 
known  optimum,  respectively.  Moreover,  it  was  proved  that  the  run  times  of  BinRecT 
and  BinRecN  would  be  longer  if  the  ratio  and  range  neighborhood  parameters  were 
designated  for  more  accurate  solutions. 

This  report  conducted  theoretical  studies  for  making  detailed  multi-stage  repair 
decisions  in  military  maintenance  systems  with  cannibalization.  As  a  future  work,  more 
applicable  solution  approaches  for  close-to-real  problems  deserve  researchers’  efforts, 
i.e.,  the  GAMS-CPLEX  combined  approach  to  large-scale  stochastic  optimization 
problems.  Other  research  directions  could  be  to  extend  the  maintenance  system 
from  a  single-based  network  to  multi-based,  and/or  to  allow  more  indenture  levels  in 
identifying,  repairing  and  cannibalizing  failed  parts. 
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CAF 
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HPP 

IID 

LORA 

LRU 

MaxRlio 

MaxRhoSTM 

MDP 

MinCost 

MinCostSTM 

MSO 

PE 

Recursion 

SDP 

STB 

SPS 

STM 

US  DoD 

The  Range-Approximated  Algorithm  for  MaxRhoSTM 
The  Ratio-Approximated  Algorithm  for  MaxRhoSTM 
Canadian  Armed  Forces 

General  Accounting  Office  of  the  United  States 
Homogeneous  Poison  process 

Independent  and  Identically  Distributed 

Level  of  Repair  Analysis 

Line  Replaceable  Unit 

The  Fleet-reliability-maximization  Model 

The  STM  Presentation  of  MaxRlio 

Markov  Decision  Process 

The  Operating-cost-minimization  Model 

The  STM  Presentation  of  MinCost 

Multi-stage  Stochastic  Optimization 

Prime  Equipment 

The  Algorithm  for  MinCostSTM 

Stochastic  Dynamic  Programming 

Scenario  Tree  Based 

Spare  Parts  Stocking 

Scenario  Tree  Model 

United  States  Department  of  Defense 
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Annex  A:  Notations  I 


T  There  are  T  mutually-disjointed  time  periods  [t,t  +  1),  where  t  =  1,2, •  •  •  ,T. 
En  At  time  one,  there  are  N  failed  PEs,  denoted  by  E\,E2,---  ,  -E/v- 
Lm  Each  PE  is  made  up  of  M  different  LRUs,  denoted  by  Li,L,2,---  ,Lm- 
sm  The  number  of  spare  Lm,  which  are  individually  existed  in  the  system. 
fm  The  number  of  functioning  Lm  at  time  one. 
gm  The  number  of  malfunctioning  Lm  at  time  one. 

qm  The  number  of  Lm  due  in  (under  repair  or  in  transshipment)  at  time  one. 
rr  The  number  of  failed  PEs  (or  PE  frames)  at  time  t,  where  t  =  1,2,  •  •  •  ,T. 

nt  The  number  of  recovered  PEs  during  time  period  t,  where  t  =  1,2,  •  •  •  ,T. 

ft  The  number  of  functioning  LRUs  at  time  t,  where  r  =  (f\ , /|,  ■  ■  ■  , .I'm)- 

The  number  of  malfunctioning  LRUs  at  time  t,  where  =  (g\,g2,  •  •  ■  ,g )■ 

q*  The  number  of  due-in  LRUs  at  time  t,  where  r  =  (q\,  ■■  •  ,  q\[)- 

M 

km  The  number  of  failed  Lm  of  a  PE  failure  with  k  =  J2  km  failed  PEs. 

m=  1 

k  The  M-entry  vector  to  denote  a  PE  failure,  i.e. ,  k  =  (ki,k2,-  ■  ■ ,  /cm)- 
A  It  takes  in  average  \  time  units  for  a  functioning  PE  to  get  failed. 

M 

The  probability  of  a  failed  PE  due  to  a  failed  Lm,  where  1  =  Pm- 

m=  1 

7  The  threshold  probability  to  specify  zero-probability  PE-failure  events. 

K  The  largest  number  of  failed  PEs  defined  by  7  such  as  —  >  7. 

The  decision  variable  to  denote  Lm-repair  operations  at  time  t. 

The  vector  to  denote  repair  operations  at  time  t,  i.e.,  =  (. x\,x\ , ,^m)- 

d ^  The  lead  time  for  sending  a  failed  Lm  to  the  depot  for  repair  at  time  t. 
a  The  interests  gained  on  per  un-used  fund  per  time  period. 
rm  The  variable  cost  for  each  Lm  repair  operation,  which  is  time- independent. 
ht  The  fixed  cost  for  transporting  one  or  more  LRUs  to  the  depot  at  time  t. 
cL  The  total  operating  cost  of  period  t,  where  t  —  1,2,  •  ■  ■  ,T. 

pt  The  recovering  ratio  of  the  first  t  periods,  where  t  =  1,2,  •  •  •  ,T. 

A  The  required/designated  level  of  fleet  reliability  in  MinCost. 

B  The  funds  for  repair  and  transportation  operations  in  MaxRho. 
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Annex  B:  Notations  II 


T 

The 

V 

The 

r 

The 

T(j) 

The 

V(j) 

The 

l(j) 

The 

m 

The 

a(j) 

The 

V(j) 

The 

C(j,l ) 

The 

V(j) 

The 

Pj 

The 

The 

D  j 

The 

The 

D  j 

The 

K? 

The 

The 

x(j,m) 

The 

Vj 

The 

The 

rm 

The 

h3 

The 

A'(],m) 

The 

B_(j,m) 

The 

B(j,m) 

The 

Tm(j ) 

The 

VmU) 

The 

Tr(j) 

The 

Vr(j) 

The 

St 

The 

Sn 

The 

notation  for  a  scenario  tree  with  a  default  root  node  1. 

set  of  all  nodes  on  T,  where  index  1  is  reserved  for  root  node. 

number  of  PE  failures  or  edges  branched  from  a  non-leaf  node. 

subtree  rooted  on  a  non-leaf  node  j.  In  particular,  T(l)  =  T. 

set  of  all  nodes  on  T(j).  In  particular,  V(l)  =  V. 

level  of  node  j  with  1  <  l(j)  <  T+  1,  Vj  G  V. 

set  of  nodes  with  level  /,  i.e. ,  C(l )  =  {j\l(j)  =  l,j  €  V}. 

direct  ancestor  of  node  j.  In  particular,  a(l)  =  0. 

set  of  direct  descendant  nodes  of  any  non-leaf  node  j. 

set  of  nodes,  which  are  in  V (j )  with  /  levels  higher  than  node  j. 

path  from  node  1  to  node  j  including  nodes  1  and  j. 

probability  of  node  j.  In  particular,  p\  —  l. 

Lm  demands  on  node  j.  In  particular,  =  0  for  all  m. 

demand  vector  on  node  j,  where  D?  =  2),-" 

F-demand  of  Lm  on  node  j.  In  particular,  -D(i,m)  =  0  for  all  m. 
F-demand  vector  on  node  j  with  D?  = 

PE  failure  denoted  by  the  edge  heading  into  node  j. 

number  of  failed  Lm  of  the  PE  failure  denoted  by  k; . 

decision  variable  denoting  the  Lm-repair  operations  on  node  j. 

decision  variable  denoting  if  there  are  any  LRU  repairs  on  node  j. 

required  lead  time  for  Lm-repair  operations  on  node  j. 

variable  cost  for  each  Lm-repair  operation  on  all  nodes. 

fixed  cost  for  transporting  LRUs  to  the  depot  on  node  j. 

set  including  all  mLeaf  nodes  in  V(j). 

set  including  all  mGRoot  nodes  in  V(j). 

set  including  all  mERoot  nodes  in  V(j). 

Lm-demand  layer  rooted  on  node  j,  e.g.,  7^(1)  =  Tm. 
set  of  nodes  on  Tm{j ),  e.g.,  Vm(l)  =  Vm. 
residual  tree  of  T(j),  e.g.,  Tr(1)  =  Tr. 
set  of  nodes  on  TrU),  e.g.,  Vij(l)  =  Vr. 
approximation  ratio,  which  can  be  used  as  stop  criteria, 
approximation  range,  which  can  be  used  as  stop  criteria. 
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